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Existence and Uniqueness Theorems for ODEs 
Khalil (Section 3.1), Sastry (Section 3.4) 


X= f(t,x) x(0) = xo (1) 
Theorem 1: f (t,x) locally Lipschitz in x and continuous in t 
= existence and uniqueness on some finite interval {0, 4]. 


Sketch of the proof: From the local Lipschitz assumption, we can 
find r > 0 and L > 0 such that 


If(tx) — f(by)| < Llx— yl] Vey € {x © RY: |x — x0] <7}. 


If x(t) is a solution, then: 


t 
x(t) = x9 +f f(t, x(t) )dt. 
—_—_——_—_ <_< 
=: T(x)(t) 
To apply the Contraction Mapping Theorem: 


1. Choose d small enough that T maps the following subset of 
C"|0, 6] to itself : 


U = {x € C" [0,6] : |x(t) — xo| <r Vt € [0, d]}, 
i.e. 
|x(t) xo] <r VWte [0,6] = |T(x)(t)—x| <r Vt e€ [0,6]. (2) 
To find such a 6 note that 
Tix)(t)—a0= f fle.x(e)yar = fi (Fe.x(0)) - fle x0) + flt%0) dt 
5 5 
IT(x)(#) —x0l < [Ife a(t) — Flerxo)lae + ff [f(t x0)lae 
5 5 
= 7 L|x(t) — xoldt + [ hdt where h is a bound on |f(t,x0)| 
< (Lr +h)o. 


Thus, by choosing 5 < 4,4, we ensure that the implication (2) 
holds. 


2. Show that T is a contraction in U, i.e., there exists 9 < 1s.t. 


xyeUu => |T(x)-Tly)lc < plx—yle. 
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Note that, for all t € [0,6], 
IT(x)(t) —TW) | = [ If(t, x(t)) — F(t, y(t) ) lat 
< a |x(T T)|dt 
< 16, max |x(t) —y(0)| = plx—yle. 


— ~~ te [0,] 
= 


Therefore, 


IT(x) —TWle = ay IT(x)(t) —TH)@| < elx— yle 


andp <1lifd< as prescribed above. 


r 
Lr+h 


Theorem 2: f(t,x) globally Lipschitz in x uniformly? in t, and contin- ? same L works for all t 
uous in t => existence and uniqueness on (0,00). 


Proof: Choose a 6 that doesn’t depend on xp and apply Theorem 1 re- 
peatedly to cover [0,00). This is possible because L works everywhere 
and we can pick r as large as we wish. Indeed, for any 6 < te we can 


choose r large enough that 6 < 74>. 


Q: Why can’t we do this in Theorem 1? 


A: 6 depends on x9 (no universal L) and xp changes at the next itera- 
tion. We can’t use the same 6 in every iteration: 


[ot ti : 
oE js Sarai 


e The theorems above are sufficient only, and can be conservative: 


Example: x = —x° is not globally Lipschitz but 
2 
x() = senaa)| ron 


1+ 2tx2 


is defined on 0,00). 


Continuous Dependence on Initial Conditions and Parameters 


Theorem 3: (Continuous dependence on initial conditions) Let 

x(t), y(t) be two solutions of x = f(t,x) starting from x9 and yo, 
and remaining in a set with Lipschitz constant L on [0,t]. Then, for 
any € > 0, there exists d(€,T) > 0 such that 


|xo — yo| < 6 => |x(t) — y(t)| <e Vt € [0,T]. 


EE C222/ME C237 - SPRING’18 - LECTURE 7 NOTES 


e This conclusion does not hold on infinite time intervals (even if f 
is globally Lipschitz). 


Example: bistable system 


If € is smaller than the distance between the two stable equilibria, no 
choice of 5 guarantees |x(t) — y(t)| <e Vt >0. 


¢ Theorem 3 also shows continuous dependence on parameter p/ in 
f (t,x, pL) if we rewrite the system equations as: 


x = f (t,x, pu) 
i =0 


x 


H 


X= X=F(t,.x)4 


F(Ex, | 
a, 


where ji appears as a state variable with initial condition (0) = p. 


Q: How do you reconcile bifurcations with continuous dependence 
on parameters? We could pick two values of the bifurcation param- 
eter arbitrarily close, but one below and one above the critical value, 
thereby expecting a drastic difference in the solutions. 


A: The two solutions are close in the short term (Theorem 3 holds on 
finite time intervals); the drastic difference builds up over time. 


Sensitivity to Parameters 


Consider the system 

X= f(t,x,u) x €R",we R? (3) 
where ji is a vector of p parameters, and let #(t, xo, 1) denote the 
trajectories starting at the initial condition x9. 


To determine to what extent this trajectory depends on the parame- 
ters we define the n x p sensitivity matrix: 
_ p(t, xo H) _ [P(t Xo MH) OP(t x0, H) 


S(t,Xx9, 1): pale / 


3 
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where each column is the sensitivity with respect to a particular 
parameter. 


To see how S(t,xo, #) can be computed numerically, first note that 
$(t, Xo, 4) satisfies the equation (3), that is, 


ee) = f(t, 0(t, x0, L),H)- 


Next, differentiate both sides with respect to p: 
Pylt xo) _ af a(t xorm) , af 


atou ox (t, P(t, Xo, HL), 9) ou ou (t, P(t, xo, HL), 9) 
and use the definition of the sensitivity matrix to rewrite this as 
aS(t,xo,H) _ af of 
yi Bix (fr PU, an Hs HISUE Bo, #) + 5 OE Xo, HB). 

Thus, S can be computed by numerical integration of (3) simultane- 
ously with 

,_ of of 

S= 5 (bX H)S + ane 


The initial condition for S is 5 = 0, assuming that xo is independent 
of the parameters. 


Example: For the harmonic oscillator 


xy = px 


XQ = px 


es re 
ox In 0 on | xy || 
Thus the sensitivity equation is 


=e 
H 


we have 


Logarithmic Sensitivity 


To compare the sensitivity with respect to multiple parameters 
}11,--+,Hy it is preferable to use the logarithmic sensitivity 


P(t, Xo, H) _ AP(t, Xo, H) 
Oni / Mi In pi; 


so that the denominator is dimensionless and represents the change 
in the parameter ji; relative to its nominal value. This means that 

the ith column of the sensitivity matrix S in (4) must be multiplied 
with the nominal parameter p/;,i = 1,...,p, before these columns are 
compared for the relative significance of the parameters. 
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Application to Parameter Tuning and Identification 


Sensitivity equations are useful for solving a class of optimization 
problems of the form 


miny J (} = fr a( t),p)d 
subject to x = x, ie a 


For example, one may take q(t,x) = |h(x(t)) — r(t)|* to penalize the 
error between the output y(t) = h(x(t)) of a control system and a 

reference trajectory r(t) to be followed. In this example x = f(t, x, 1) 
represents the closed loop model with tunable control parameters pi. 


In other applications x = f (t,x,)) may represent the model of a 
physical process with unknown parameters, and q(t,x) = |h(x(t)) — 
r(t)|? penalizes the error between the model prediction for a variable, 
y(t) = h(x(t)), and the experimental observation r(t). Then the 
optimization problem above aims to find parameters that best fit the 
experimental data. 


A typical optimization algorithm requires the gradient he H) which 
can be obtained with the help of the chain rule and the sensitivity 
equations: 


2) = [ (SEe.x(0,1956, x00) + si(x(t))) dt. 


